2020-02-17 Meeting 1     2020-02-29 Meeting 2     2020-03-07 Meeting 3     csua     index     pytorch     seq2seq     umtn     website    

Meeting 2 2/29

Attendance

Recap

Teams and Tasks

We are lucky to have access to source code, data, and pretrained models which the paper authors and other researchers have released.

However, we also face a challenge in that the network takes a long time to train. It’s unclear at this point how much 2-gpu training time is needed to replicate the pretrained checkpoint, which used either 1152 or 3456 GPU hours (the GitHub and paper disagree on the exact number). This will also make it harder to test our re-implementation and make sure it’s working correctly.

Because of this, we will try working on different tasks in parallel. We will split into 3 groups of 2, each group working on a different task. Every 2 weeks, there will be an opportunity to switch groups/tasks so that everyone can try working on different tasks throughout the semester.

Team assignments for the next 2 weeks were decided at the meeting:

  1. (James, Alicia) Understanding and re-implementing the UMTN architecture.
  2. (Praveen, Andrew) Experiments related to training the UTMN.
  3. (Jasmine, Jim) Experiments related to evaluating the UTMN.

What to do by next meeting

If you missed the meeting:

Reimplementation team:

Training experiments team:

Evaluation experiments team:

Presenter for GM: